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Abstract 

Background: MicroRNAs (miRNAs) and short interfering RNAs (siRNAs) are two major classes of small RNAs. They 
play important regulatory roles in plants and animals by regulating transcription, stability and/or translation of 
target genes in a sequence-complementary dependent manner. Over 4,000 miRNAs and several classes of siRNAs 
have been identified in plants, but in tobacco only computational prediction has been performed and no tobacco- 
specific miRNA has been experimentally identified. Wounding is believed to induce defensive response in tobacco, 
but the mechanism responsible for this response is yet to be uncovered. 

Results: To get insight into the role of small RNAs in damage-induced responses, we sequenced and analysed 
small RNA populations in roots and leaves from wounding or topping treated tobacco plants. In addition to 
confirmation of expression of 27 known miRNA families, we identified 59 novel tobacco-specific miRNA members 
of 38 families and a large number of loci generating phased 21- or 24-nt small RNAs (including ta-siRNAs). A 
number of miRNAs and phased small RNAs were found to be responsive to wounding or topping treatment. 
Targets of small RNAs were further surveyed by degradome sequencing. 

Conclusions: The expression changes of miRNAs and phased small RNAs responsive to wounding or topping and 
identification of defense related targets for these small RNAs suggest that the inducible defense response in 
tobacco might be controlled by pathways involving small RNAs. 



Background 

Small RNAs are a group of regulatory molecules that fall 
into two major classes, microRNAs (miRNAs) and short 
interfering RNAs (siRNAs). They play important roles in 
biological systems in eukaryotes by suppressing expression 
of target genes at the transcriptional and/or post-tran- 
scriptional level through specific base pairing with their 
targets [1]. In plants, siRNAs are further classified into 
trans-diCtmg siRNAs (ta-siRNAs), natural antisense tran- 
script-derived siRNAs (nat-siRNAs), and repeat-associated 
siRNAs (ra-siRNAs) [2]. In addition, a novel class of bac- 
teria-induced 30- to 40-nt endogenous small RNAs, long 
siRNAs (IsiRNAs), was identified in Arabidopsis [3]. 

As an important group of small RNAs, miRNA has 
attracted much attention. A number of studies have been 
performed to reveal the biogenesis of miRNAs and the 
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mechanisms of miRNA-mediated gene regulation [4-6]. 
In plants, miRNA derives from primary miRNA tran- 
script (pri-miRNA), which is transcribed by RNA poly- 
merase II. After formation of a stem-loop secondary 
structure [7,8], pri-miRNA is cleaved twice by DICER- 
LIKEl (DCLl), a RNase III enzyme [9]. The first cleavage 
of DCLl releases a miRNA precursor (pre-miRNA), and 
the second cleavage of DCLl generates a pair of miRNA/ 
miRNA'' duplex. The mature miRNA targets mRNA by 
being incorporated into the RNA-induced silencing com- 
plex (RISC) and plays a role in regulation of gene expres- 
sion [4], while miRNA'', the complementary strand of the 
mature miRNA, is usually degraded and applies no func- 
tion [10] although functional miRNA'' has been reported 
[11]. In plants, miRNAs have been demonstrated to be 
involved in many biological and metabolic processes, 
including developmental regulation, growth control, cell 
differentiation, signal transduction, and biotic and abiotic 
stresses [5,12-15]. For example, miR172 regulates floral 
organ identity and flowering time by cleavage and/or 
translational repression of AP2-domain containing genes 
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[16-18]; miR159 has been shown to be required for nor- 
mal anther development by mediating the expression of 
GAMYB-related genes [19]. 

Unlike miRNAs, endogenous siRNAs arise from loci, in 
which double-stranded RNAs are able to be formed by 
the action of RNA-dependent polymerase or by two con- 
vergent transcripts. ta-siRNAs are 21-nt phased small 
RNAs and function like miRNAs. TAS3 is conserved in 
many plant species. In Arabidopsis and rice four and 
three TAS loci have been identified, respectively [20-22]. 
Besides ta-siRNAs, other types of phased small RNAs 
have also been reported recently, such as miRNA-like 
siRNAs from long hairpin [23] and phased small RNAs 
from superclusters flanked by a common 22-nt motif 
targeted by miR2118 [24]. 

As the major compounds of tobacco {Nicotiana taha- 
cum), nicotine alkaloids have defensive functions in 
response to pathogen infection and herbivore wounding. 
Studies on alkaloids have been focused on the harm of 
smoking cigarettes and addictions to nicotine [25,26]. The 
potential use of nicotine alkaloids as insecticides in agri- 
culture has also been investigated [27]. Nicotine alkaloids 
are synthesized in tobacco roots and delivered to leaves 
through the plant transport systems, consequently nicotine 
alkaloids are mainly accumulated in leaves in most Nicoti- 
ana species [28,29]. Nicotine biosynthesis can be induced 
through topping or leaf wounding to tobacco plants 
[30,31] and significant increase of nicotine content was 
observed in the wounded tobacco plants [32] . In the past 
20 years, a series of genes involved in the nicotine biosyn- 
thetic pathways have been identified, e.g. PMT (putrescine 
A/^-methyltransferase) [33,34]; MPO (A^-methylputrescine 
oxidase) [35]; QPT (quinolinate phosphoribosyltransferase) 
[36,37] and A622 [38]. However, it is unclear whether 
samll RNAs are responsive to wounding and are involved 
in induction of nicotines upon wounding. 

Over 4,000 miRNAs (miRBase Release 16, http://www. 
mirbase.org) and several kinds of siRNAs have been iden- 
tified in plants. Recently, identification of known con- 
served miRNAs has been reported in tobacco [39,40]. 
However, no tobacco-specific miRNA has been experi- 
mentally isolated. In this study, we intend to experimen- 
tally indentify miRNAs and siRNAs in the tobacco 
genome, especially small RNAs responsive to damages 
through high-throughput sequencing of small RNA popu- 
lations from wounding or topping treated tobacco plants. 
To identify potential targets of small RNAs, a degradome 
from wounding treated plants was sequenced. As a result, 
159 known and novel tobacco-specific miRNAs together 
with a number of loci generating phased small RNAs were 
identified. Expression levels of some of these small RNAs 
were changed after wounding or topping treatment, sug- 
gesting that these small RNAs may play a defensive role in 
response to wounding or topping damage in tobacco. 



Results 

Deep sequencing reveals a diverse set of tobacco small 
RNAs 

To get insight into the composition of small RNA popu- 
lations and their changes after damage treatments in 
tobacco, we isolated small RNAs from roots and leaves of 
leaf-wounding-treated plant, roots of topping-treated 
plant and roots of untreated plant. Small RNAs were 
then sequenced using the lUumina high-throughput 
sequencing technology. A total of 18.5 million of raw 
reads (15-35 nt) were generated from four samples 
(Table 1). Similar to previous reports on small RNA clon- 
ing in other plant species, 20-24-nt reads were dominated 
in all four small RNA populations with the 24-nt small 
RNAs the most abundant species (Figure 1). Of the 20- 
24-nt reads, 42.2% and 20.7% were perfectly mapped to 
the Tobacco Genome Initiative (TGI) GSS and EST data- 
sets (see Methods). Many small RNAs were singletons or 
observed only in one sample. Less than 3% of the small 
RNA reads were common in all four samples. These 
results illustrate the complexity of the tobacco small 



Table 1 Summary of high-throughput sequencing of 
tobacco small RNAs from root and leaf 



Category 




Root 




Leaf 




Control 


Topping 


Wounding 


Wounding 


Total 










Raw reads 


3,074,948 


5,186,473 


5,510,747 


4,722,862 


Unique raw reads 


1,018,809 


1,959,287 


3,083,036 


2,178,969 


Mapped to GSS* 










Reads 


1,277,575 


2,632,084 


2,700,532 


2,469,534 


Specific in 
eacli library 


206,310 


477,279 


803,960 


491,809 


Conserved in 
all libraries 


782,079 


1,667,977 


1,085,172 


1,312,822 


Unique 


252,425 


542,516 


965,547 


687,949 


Specific in 
each libraries 


148,981 


385,469 


702,522 


442,337 


Conserved in 
all libraries 


25,780 


25,780 


25,780 


25,780 


Singleton 


202,567 


417,905 


761,185 


534,409 


Mapped to EST* 










Reads 


764,235 


1,293,204 


1,218,970 


1,528,346 


Specific in 
each libraries 


43,758 


99,553 


137,509 


107,459 


Conserved in 
all libraries 


563,565 


985,513 


811,452 


1,165,580 


Unique 


72,750 


129,710 


186,601 


149,454 


Specific in 
each libraries 


32,875 


77,511 


116,666 


84,638 


Conserved in 
all libraries 


14,001 


14,001 


14,001 


14,001 


Singleton 


48,486 


88,566 


131,515 


102,406 



#Only 19-25-nt reads were counted 
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Figure 1 Length distribution of small RNAs from different tissues and treatments. Y-axis: number of normalized small RNA reads (xlO^). 



RNA populations, which might partly be due to tetra- 
ploid characteristic of tobacco. 

Identification of miRNAs and loci generating phased small 
RNAs 

mi RNA identification 

We searched miRBase for tobacco homologs of known 
miRNAs. As a result, homologs of 27 known miRNA 
families were found in our dataset. Based on BLAST 
search against the tobacco GSS and EST databases, we 
found that the mature miRNAs of these 27 known 
miRNA families match to 100 distinct GSS or EST 
sequences. All these sequences are able to form stem-loop 
structures; therefore they were considered as pre-miRNAs 
(Additional file 1: Table SI). Although 21 of these 27 
miRNA families has been shown to be conserved in 
tobacco previously using a computational approach [31], 
our experimental approach confirmed the conservation of 
additional five miRNAs (miR408, miR477, miR1919, 
miR2118 and miR2911) in tobacco, which were not found 
computationally. Forty-four computationally identified 
tobacco miRNA families were not detected in our dataset, 
which could be because these miRNAs are lowly expressed 
and remain undetectable at the sequencing depth in our 
experiments or alternatively they are expressed in specific 
tissues that are not sampled in our experiments. 

To identify novel tobacco miRNAs, we developed a 
miRNA discovery pipeline (see Methods). Using the cus- 
tom-designed pipeline, we identified 59 novel tobacco 
miRNAs belonging to 38 families (Table 2; Additional file 
1: Table SI). Of these 38 novel miRNA families, 20 
families (53%) had a 21-nt mature miRNA sequences. Of 
these 20 families, 14 families (70%) had a mature miRNAs 



started with U. These results well supported the miRNA 
identity of these newly identified miRNAs. Of the 38 novel 
miRNA families, 13 families (34%) had a 22-nt mature 
miRNAs, which have been shown to be the secondary 
siRNA triggers [23,41]. Whether these 22-nt novel miR- 
NAs play a role in the biogenesis of secondary siRNAs is 
waiting to be investigated. Twenty-seven of the 38 novel 
miRNA families contain only one member, suggesting 
their relatively recent origin. 
Identification of loci generating phased small RNAs 
To identify loci generating phased small RNAs, small RNA 
clusters were first defined based on the number of small 
RNAs aligned to a certain length of sequence segment and 
the relative distance between neighbouring small RNAs 
[24]. Using the criteria mentioned in Methods, we identi- 
fied 1,890-15,252 21-nt clusters and 11,726-107,765 24-nt 
clusters in four samples (Table 3). As expected, there were 
more 24-nt clusters than 21-nt clusters. These small RNA 
clusters were then analysed using the methodology 
described previously to see whether the small RNAs within 
these clusters distributed in a phased pattern, which was 
determined by a phase score [42]. A higher phase score 
indicates a stronger phase signal. With a phase score of 
1.4 as suggested previously [42], we found that 189-3,098 
and 1,359-20,112 loci were able to produce 21-nt and 24- 
nt phased small RNAs in four samples, respectively. The 
number of loci generating phased small RNAs was signifi- 
cantly reduced when the phase score was increased to 10 
(Table 3). These loci are shown in Additional file 2: Table 
S2, and five representative loci generating 21-nt- or 24- nt- 
phased small RNAs are shown in Figure 2. 
In plants, the typical loci generating phased siRNAs are 
TASs [20-22]. TAS3 with the miR390 cleavage sites has 
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Table 2 Novel tobacco mIRNAs Identified In this study 

Novel miRNAs Size (nt) Pre-miRNA hit Number of the mature miRNA reads (RPM) Mature miRNA sequence 
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0.2 


1.3 


0.1 


0.1 


AAAATGTGGCCGGATACGTGT 


Nta-miR25 


22 


FH081 190.1 


2.6 


20.2 


389.8 


12.9 


TGAACTCTCTCCCTCAATGGCT 


Nta-miR26a 


21 


ET837426.1 


35.1 


179.9 


2.2 


0.4 


ATOTOCATGTAACACTGGC 


Nta-miR26b 


21 


FH393496.1 


483.9 


1146.6 


1.5 


1.3 


ATOTOCATGTAGCACTGGC 


Nta-miR27a 


22 


FH905151.1 


0.7 


1.3 


0.2 


0.1 


AAGTOGAmGTACGAAGGGC 


Nta-miR27b 


22 


FH91 5892.1 


0.7 


1.3 


0.2 


0.1 


AAGTOGAmGTACGAAGGGC 


Nta-miR27c 


22 


FH918314.1 


0.7 


1.3 


0.2 


0.1 


AAGTOGAmGTACGAAGGGC 


Nta-miR28 


22 


FH253762.1 


11.4 


11.4 


13.4 


12.3 


TAGCATAGAATOTCGCACCTA 


Nta-miR29 


21 


FH324206.1 


0.0 


2.1 


0.0 


0.0 


ACGGGTGCGGCTACA^GG 


Nta-miR30 


21 


FI086539.1 


16.6 


7.7 


2.2 


0.2 


ATCGTAACATATAGCACTAGC 
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Table 2 Novel tobacco mlRNAs Identified In this study (Continued) 



N ; DO 1 
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24 


CTQ O Q ^ 1 r> 1 
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0.0 
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Nta-miR32 


21 


FH560937.1 


0.0 


0.0 


1.6 


1.7 


TGAACTCCAGCATATOTACT 


Nta-miR33a 


24 


FH761 979.1 


0.2 


04 


4.0 


5.3 


GCTGGACCGGTATACmGCTGAC 


Nta-miR33b 


24 


FH762383.1 


0.2 


04 


4.0 


5.3 


GCTGGACCGGTATACmGCTGAC 


Nta-miR34 


22 


ET840416.1 


1.6 


24.7 


13.2 


7.8 


TOCCGACTCCCCCCATACCAC 


Nta-miR35 


21 


PHI 19649.1 


0.0 


0.0 


54 


1.1 


AGAAAAATGGTAGCCATOGA 


Nta-miR36 


24 


FH408012.1 


0.0 


0.0 


21.6 


27.3 


AATATACTGGAGTOGGTGCACCT 


Nta-miR37 


22 


FH666318.1 


0.0 


0.0 


3.3 


04 


TGGAAGTACTGCCTAAGmGA 


Nta-miR38a 


21 


FH681 839.1 


0.8 


1.0 


0.5 


0.1 


TCACATAAATOAAACGGAGG 


Nta-miR38b 


21 


FH968988.1 


0.8 


1.0 


0.5 


0.1 


TCACATAAATOAAACGGAGG 



See Additional file 1: Table SI for the detailed pre-miRNA sequences and their secondary structures 



been found to be conserved in plants, including tobacco. 
In this study, we experimentally confirmed the conserva- 
tion of TAS3 in tobacco. At least one TAS3 gene (GSS 
acc. no. FH434354) with two miR390 binding sites and 
generating the conserved ta-siARFs was confidently iden- 
tified with a phase score of 31.3. We searched targeting 
sites of the newly identified miRNAs in this study in loci 
with a phase score over 10, and found several such loci 
were potentially targeted by Nta-miR15, Nta-miR22, Nta- 
miR31, Nta-miR33, Nta-miR36 or Nta-miR37 (Additional 
file 2: Table S2). Whether these novel miRNAs are 
involved in biogenesis of the phased small RNAs 
observed is waiting to be investigated experimentally. 

Identification of small RNA targets by degradome 

sequencing 

Targets of miRNAs 

Recently, degradome sequencing has been successfully 
applied to identify targets of miRNAs in Arabidopsis and 
rice [43,44]. Taking the advantage of this new approach, 
we sequenced a degradome library constructed using 
total RNA isolated from roots of leaf-wounding treated 
tobacco plants. Over 14 million of raw reads were gener- 
ated. Out of 685,411 unique clean reads (excluding reads 
with low quality and mapped to known house-keeping 
noncoding RNAs), representing -11.8 million of raw 
reads, 55.2% (378,597) and 56.5% (387,518) were mapped 
to tobacco ESTs and GSSs, respectively. These mappable 



reads were then used for target gene identification using 
the CleaveLand pipeline as described [43,45]. 

One or more targets were uncovered for 12 conserved 
and five tobacco-specific miRNA families (Additional file 
3: Table S3). These targets were classified into five cate- 
gories following the description indicated in the software 
CleaveLand 2. Six representative miRNA targets identified 
by degradome sequencing were shown in Figure 3. Four- 
teen targets of seven miRNA families were fallen into 
Category 0 and 1, for which the most abundant cleavage 
signal was detected at the expected miRNA-mediated clea- 
vage sites. Functions of the conserved miRNA target genes 
are evolutionary conserved between different plants, 
including tobacco. For example, miR160 has been shown 
to regulate expression of ARF (auxin-response factor) 
genes in Arabidopsis and tomato [46,47]. Our results 
showed that ARF is one of the potential targets of miR160 
in tobacco, suggesting a conserved role of miR160 in 
tobacco. Similarly, conserved targets were also identified 
for miR171 (hairy meristem gene, a SCARECROW-like 
transcription factor) and miR172 (AP2- domain containing 
genes) (Additional file 3: Table S3). However, conserved 
miRNAs might have non-conserved function in tobacco, 
for instance, salicylic acid-induced protein 19, which has 
been reported to have multiple functions in defense 
response in tobacco [48], was the only target found for 
miR164, suggesting that miR164 could be part of the 
defensive pathway involving the salicylic acid signal. 



Table 3 Number of small RNA clusters and loci generating 21 -nt- or 24-nt-phased siRNAs 

Root Leaf 

Category Phase size (nt) Control Topping Wounding Wounding 

Small RNA clusters 21 1890 5116 15252 6559 

24 11726 37000 107765 84270 

Phased siRNA generating loci (P > 1.4) 21 189 919 3098 973 

24 1359 5244 20112 14681 

Phased siRNA generating loci > 10) 21 13 50 220 57 

24 7 115 691 508 



Loci homologous to known house-keeping noncoding RNAs, such as rRNA, tRNA, snRNA and snoRNA and overlapped with repeat sequences were excluded. The 
phase scores (P values) were calculated following the description in Howell et al. [42] 
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Figure 2 Examples of loci generating phased 21 -nt small RNAs (FH494072 and FH698163) and phased 24-nt small RNAs (ET713388, 
ET712651 and ET977252). The middle and the top graph in each locus represent the distribution of small RNA reads across the whole locus and the 
phase signals generated from the small RNA reads, respectively. Each arrow underneath the plot graph represents a 21-nt or 24-nt phase with the 
arrow direction indicating the forward or the reverse strand. The red arrows represent the phase with the highest number of small RNA reads. Target(s) 
of the phased small RNA corresponding to the red arrow, identified by degradome sequencing, are shown at the bottom of each graph. 
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Figure 3 Plot signals of the candidate targets of miRNAs predicted by degradome sequencing. Targets of conserved miRNAs (miR160, 
miR171, miR172 and miR398) and tobacco-specific miRNAs (Nta-miR22 and Nta-miR32) are sliown. Tine X-axis displays tine nucleotide position of 
the target genes. The Y-axis indicates the abundance of reads converted into transcripts per 10 million (TPIOM). Each circle represents a 
degradome fragment mapped to the target gene and the circle indicated by the red arrow represents the expected miRNA cleavage product. 
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The newly identified miRNAs were found to have 
diverse targets, but most of them seem to have a role in 
stress responses, suggesting that most of the newly iden- 
tified miRNAs could function as regulators of defensive 
pathways in tobacco. For example, our degradome 
sequencing result showed that Nta-miRlO targets a gene 
encoding Avr9/Cf-9 rapidly elicited protein 4 (ACRE4), 
which plays a role in resistance to wounding and 
mechanical stress in tobacco [49]. Nta-miR38 targets 
genes encoding nematode resistance-like protein and 
ankyrin repeat containing protein, both are induced 
upon infection of disease [50,51]. 
Targets of phased small RNAs 

One of the phased small RNAs generated from the TAS3 
locus is ta-siARF, which targets ARF genes that influence 
leaf morphology and lateral root growth [52,53]. To know 
whether the phased small RNAs identified in this study, 
like ta-siARF, are able to regulate expression of other 
genes, we checked the presence of potential targets of the 
most abundant 30 phased small RNAs observed in the 
damage-treated samples in our degradome sequencing 



dataset. We found targets for five (two 21-nt and three 
24-nt) of the 30 selected phased small RNAs (Figure 2; 
Additional file 4: Table S4). For example, the D2(-) phased 
small RNA from locus FH494072, which was specifically 
detected in wounding or topping treated samples, was 
found to target an hypothetical protein; the D4(+) phased 
small RNA from locus ET7 12651 was found to target sev- 
eral genes with diverse functions, including putative dis- 
ease resistance protein, dicer-like protein and type-A 
phytochrome. 

We did gene ontology (GO) analysis for the target 
genes of miRNAs and phased small RNAs. Although 
miRNAs and phased small RNAs had most of their tar- 
gets grouped into the same GO terms, including response 
to stimulus and immune system process, it seems that 
miRNAs and the phased small RNAs are targeting speci- 
fic genes involved in reproduction (for miRNAs) and cel- 
lular component biogenesis and organization (for the 
phased small RNAs), respectively (Figure 4), suggesting a 
unique role of miRNAs and the pahsed small RNAs in 
these processes. 
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Figure 4 Gene ontology analysis of the targets of miRNAs and the phased small RNAs. 
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Small RNAs responsive to topping and wounding 
treatments 

Topping and wounding responsive miRNAs 

In order to compare expression levels of small RNAs in 
different samples, the abundance of each unique small 
RNA was normalized by converting the absolute read 
numbers to relative read numbers (reads per million) 
based on the total number of reads in each library. To get 
insight into the potential role of miRNAs in response to 
damage treatments, we compared the expression changes 
of all miRNAs detected in our datasets. Most of the miR- 
NAs identified in this study changed their expression 
upon topping or wounding treatment. Eight conserved 
and seven newly identified miRNAs showed significant 
up-regulation or down- regulation upon damage treatment 
(Figure 5). Out of the eight conserved miRNAs, miR159, 
miR164, miR167 and miR172 have been previously shown 
to be stress-responsive [15]. miR164 and miR168 were sig- 
nificantly induced in roots by both topping and wounding 
treatments. miR172 and miR390 were significantly 
induced only in roots treated by wounding. Furthermore, 
miR159, miR319 and miR2911 were up-regulated only in 
topping treated sample while were down-regulated or 
remaining unchanged in wounding treated sample. These 
results indicate different miRNA families behave differ- 
ently upon damage treatment, suggesting a different role 



of these miRNAs in response to damage treatment in 
tobacco. In addition, seven conserved miRNA families 
(miR169, miR395, miR397, miR398, miR399, miR408 and 
miR827) were not detected in the untreated tobacco root 
sample but were detected in at least one damage treated 
sample (Additional file 1: Table SI). Among these miR- 
NAs, miR395 and miR398 have been shown to response 
to a variety of stresses [15]. 

To confirm the deep sequencing results, the expression 
levels of miR156a, miR157 and miR164a were analysed 
using northern blot. According to northern results, the 
expression levels of miR164a and miR156a were higher in 
roots than in leaves, while the expression levels of miR157 
were higher in leaves than in roots (Figure 6), consistent 
with the deep sequencing results (Additional file 1: Table 
SI); however, for miR164a, the induction detected by 
northern analysis was not as significant as that detected by 
deep sequencing; for miR156a and miR157, a discrepant 
result was observed in the wounding treated root samples 
between deep sequencing and northern analysis (Figure 6; 
Additional file 1: Table SI). The induction observed in 
deep sequencing results could be an artifact caused by 
biased adaptor ligation and PGR amplification during the 
procedure of small RNA library construction. 

Out of the 38 newly identified miRNA families, 27 
were relatively highly expressed, particularly Nta-miR9, 



Topping/CK Wounding/CK 

miRI 68 
miRI 64a 
miRI 67a 
Nta-miR25 
Nta-miR1 
miRI 67b 
Nta-miR4c 
miRI 72a 
miR390a 
Nta-miR12 
miR479 
Nta-miR9a 
miRI 59 
Nta-miR23 
Nta-miR26 

Figure 5 Heat map showing miRNAs with a significant expression change upon wounding or topping treatment in roots. 
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Figure 6 Northern blot analysis of miRNA expression. LC: control leaves; RC: control roots; RW: roots from leaf wounded plants; RT: roots 
from topping treated plants; LW: leaves from leaf wounded plants. Tobacco U6 was served as a loading control. 



Nta-miR17 and Nta-miR26 (Table 2). Northern analysis 
of Nta-miR9 showed that this novel miRNA is highly 
expressed in roots but lowly expressed in leaf, consistent 
with the deep sequencing results. Nta-miR9 was highly 
induced by wounding in leaves and slightly induced in 
roots from the topping treated plants (Figure 6), but a 
reduction of this miRNA in roots from the wounding 
treated plants detected by deep sequencing was not 
detected by northern analysis, which could be because 
of cross hybridization of other members of this miRNA 
family that failed to be detected by deep sequencing. 
The discrepancy observed between deep sequencing and 
northern blot analysis suggests that caution must be 
taken when directly compare small RNA expression 
levels observed by either approach. 

Nta-miRl, Nta-miR4c and Nta-miR25 were highly 
induced in roots by both wounding and topping treat- 
ments; Nta-miR26 was highly induced only in topping 



treated sample (Figure 5). Nine newly identified miRNA 
families (Nta-miR15, Nta-miR21-22, Nta-miR29, Nta- 
miR31-32 and Nta-miR35-37) were not expressed in the 
untreated roots but were detected in damage treated sam- 
ples. Different members of the same miRNA family could 
behave differently upon damage treatment, for example, 
Nta-miR4a.2 was highly induced by damage treatment, 
whereas Nta-miR4a.l and miR4b were not induced. 
Topping and wounding responsive phased small RNAs 
Generally, the number of small RNA clusters was much 
higher in the damage treated samples (Table 3), indicating 
a number of small RNAs could be induced by these treat- 
ments. The number of loci generating both 21-nt- and 
24-nt-phased small RNAs increased dramatically after top- 
ping or leaf-wounding treatment. Some of these loci were 
found in two, three or all four samples, but some of them 
were unique to each treatment (Figure 7). Similar to what 
has been found for miRNAs, wounding treated root 
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Figure 7 Venn diagram showing the number of loci (with a phase score over 1.4) generating phased small RNAs in four libraries. Blue and 
red numbers represent the number of loci generating 21 -nt- and 24-nt-phased small RNAs, respectively. 



sample had the most abundant uniquely up-regulated loci 
that generate phased small RNAs (Figure 7). Two phased 
small RNAs were analysed using northern blot, but no sig- 
nal was detected (data not shown), mainly due to their low 
expression levels. 

Discussion 

Recent computational prediction identified 259 potential 
conserved tobacco miRNAs of 65 families [39], but com- 
putational prediction based on known miRNAs is unable 
to identify novel miRNAs and miRNAs expressed under 
specific conditions, such as biotic and abiotic stresses. In 
this study, we used high-throughput sequencing approach 
to experimentally confirm expression of the known miR- 
NAs and their changes upon wounding or topping 



treatment, and to identify novel tobacco-specific miRNAs. 
Our study detected the expression of 100 known miRNAs 
(belonging to 27 families) in tobacco roots and leaves, 
including miR408, miR477, miR1919, miR2118 and 
miR2911 that was not previously computationally identi- 
fied. More importantly, our work identified 59 novel 
tobacco-specific miRNAs (belonging to 38 families). 

Upon damage treatments, eight conserved and seven 
newly identified miRNAs showed significant up-regula- 
tion or down-regulation (Figure 5). In addition, seven 
conserved and nine newly identified miRNA families 
were only detected in the damage treated sample(s). The 
majority of these damage-responsive conserved miRNA 
families have been previously shown to be stress respon- 
sive in other plant species, suggesting that these miRNAs 
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might also play a positive regulatory role in stress-toler- 
ance in tobacco. In Arabidopsis, miR164 is essential for 
floral, leaf and root development [54,55]. We found that 
this miRNA was induced in roots upon wounding or top- 
ping treatment (Table 2; Figure 6) and that one of the 
targets encodes a salicylic acid-induced protein; there- 
fore, it is possible that miR164 is required for homeosta- 
sis of defensive proteins in tobacco. In addition, most 
targets of the newly identified miRNAs responsive to top- 
ping or wounding, found in the degradome sequencing 
results, seem to be regulators of various defensive path- 
ways, implicating the importance of these miRNAs in 
stress responses in tobacco. 

Different damages seem to have different effects on the 
expression of miRNAs and siRNAs. The abundance of 
small RNAs in the wounding treated leaf sample was 
generally much higher than that in the topping and 
wounding treated root samples, suggesting that the 
induction of small RNAs is more significant in the tissue 
where the damage was directly applied. Induction of 
small RNAs in roots collected from the leaf-wounding 
plants suggests that a mobile signal responsible for this 
induction is generated in leaves and transported to roots. 
This signal is most likely a type of phytohormone, such 
as salicylic acid, jasmonic acid or ethylene. However the 
exact nature of this signal is waiting to be resolved. 

Silencing of RNA-directed RNA polymerase 1 (RdRl) 
makes Nicotiana attenuata highly susceptible to insect 
herbivores, suggesting the defense elicitation in Nicotiana 
attenuata is under the direct control of small RNAs [56] . 
Further investigation has found large-scale changes of 
small RNAs associated with insect elicitation [57]. 
Wounding is similar to insect herbivores and induction 
of small RNAs in the wounding treated tobacco plants is 
consistent with this recent finding. This result suggests 
that small RNAs could play a central role in reprogram- 
ming the tobacco transcriptome in response to biotic and 
abiotic stresses. It is unclear how the stress-transcrip- 
tome of tobacco has changed upon wounding or topping, 
but our work has provided bases for further investigation 
on roles of small RNAs in stress responses in tobacco. 

In plants, TASs are the noncoding transcripts targeted 
by miRNAs to generate phased small RNAs. Recent 
investigation has demonstrated that the 22-nt miRNAs, 
rather than the 21-nt miRNAs, are the triggers for pro- 
duction of secondary phased small RNAs [41]. Out of 
the 38 newly identified miRNA families, 13 families 
have a 22-nt miRNA. In addition, we found that a large 
number of genomic regions in tobacco are able to pro- 
duce phased 21-nt or 24-nt small RNAs and that a 
number of these regions containing potential binding 
sites of the newly identified miRNAs. Although we did 
not find these binding sites are enriched for 22-nt miR- 
NAs, it is worth to perform further detailed analysis to 



see if the 22-nt miRNAs found in this study play a role 
in biogenesis of the secondary phased small RNAs. 

Conclusions 

Our study experimentally confirmed the conservation of a 
number of known miRNAs in tobacco, including con- 
served miRNAs that were not previously identified by 
computational prediction. Identification of damage- 
responsive miRNAs and phased small RNAs and defense 
related targets of these small RNAs suggest that the indu- 
cible defense responses in tobacco might be controlled by 
pathways involving small RNAs. 

Methods 

Plant materials and sample preparation 

All tissue samples were collected from tobacco cultivar. 
Hicks Broad Leaf. Plants were grown in the growth room 
with a temperature of 22-25°C in order to minimize exter- 
nal affects on the biosynthesis of nicotine alkaloids. Three 
40-day-old plants with roughly identical size were selected 
for the experiment. One was used for mechanical leaf 
wounding by making punctures on at least five fully 
expanded leaves, one was used for the topping treatment, 
and the third one as control (CK). After damage treat- 
ments, plants were kept for another 48 hours in the 
growth room before sample collection to allow occurring 
of damage induced responses. Roots were collected from 
the wounding and topping treated plants as well as the 
control plant. Leaf samples were collected from the 
wounding treated and control plants. 

RNA isolation, small RNA and degradome sequencing 

Tissues were ground into a fine powder in liquid nitrogen 
and total RNAs were extracted using the TRIzol® 
Reagent kit (Invitrogen). Four small RNA libraries (leaves 
and roots from the wounding treated plant, roots from 
the topping treated and the control plants) were con- 
structed using the Small RNA Sample Prep Kit (Illumina) 
[58,59]. Briefly, small RNAs were fractionated on the 15% 
polyacrylamide, 8 M urea gel. The 18-25 nt portion of 
the gel was excised and small RNAs were eluted and pur- 
ified using the nucleic acid purification kit (Axygen). 
Small RNAs were then sequentially ligated with a 5' RNA 
adapter (5'-GUUCAGAGUUCUACAGUCCGACGAUC- 
3') and a 3' RNA adapter (5'-pUCGUAUGCCGUCUU- 
CUGCUUGUidT-3'). cDNA libraries were constructed 
through reverse transcription using Superscript^^ III 
(Invitrogen) and enriched by 15 cycles of PGR. The final 
PGR products were then purified using the PureLink™ 
PGR Purification Kit (Invitrogen) and sequenced using 
the SOLEXA genome analysis system. Primers used were: 
5'-CAAGCAGAAGACGGCATACGA-3' (RT primer), 5'- 
CAAGCAGAAGACGGCATACGA-3' (smaU RNA PGR 
primer 1), 5'-AATGATAGGGGGAGGAGGGAGAGGT 
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TCAGAGTTCTACAGTCCGA-3' (small RNA PGR pri- 
mer 2) and 5'-CGACAGGTTCAGAGTTCTACAGTCC- 
GACGATC-3' (small RNA sequencing primer). 

The degradome library was constructed according to a 
published protocol [60]. Briefly, RNA fragments with a 
poly(A) tail were isolated from total RNA of wounding 
treated leaves using the Oligotex mRNA mini kit (Qia- 
gen), and then a 5' RNA adapter with a Mmel restric- 
tion site at its 3' end was added to the 5' ends of the 
isolated poly(A) RNAs. After reverse transcription using 
oligo d(T) and PGR enrichment, the PGR products were 
purified and digested with MmeL After ligating a dou- 
ble-stranded DNA adapter to the 3' end of the digested 
products, the ligated products were further purified and 
amplified, and then sequenced using the lUumina GA II 
platform. 

Tobacco sequences and other databases 

The tobacco reference genomic (1,420,578 GSSs) and 
EST (80,783 generated from tobacco cultivar Hicks 
Broad Leaf by TGI) sequences were retrieved from Gen- 
Bank. The nt database were downloaded from GenBank 
for annotation of the candidate miRNA genes and phased 
siRNA loci. Rfam 9.1 (http://rfam.janelia.org/) and 
RepBase 14.03 (http://www.girinst.org) were used for fil- 
tering known RNAs and repeat sequences. Transcription 
factor sequences of tobacco were retrieved from http:// 
compsysbio.achs.virginia.edu/tobfac/ [61]. Genes involved 
in nicotine pathways were collected from previous stu- 
dies [26-32]. miRNAs in miRBase 16.0 were used for 
identification of tobacco homologs of the known miR- 
NAs (http://www.mirbase.org/). 

miRNA identification 

All small RNA data were processed by a suite of perl 
scripts. To identify candidate miRNAs, a pair of small 
RNAs located at the same GSS or EST and no more than 
400-nt apart were selected and the sequence between 
these two small RNAs was subjected to prediction of 
stem-loop structure using the Vienna RNA package [62]. 
Candidates (pre-miRNAs) with an ideal hairpin structure 
containing the pair of small RNAs that is able to form a 
miRNA:: miRNA'' duplex with less than 4 mismatches and 
2-nt of 3' overhangs were selected. Small RNAs aligned to 
these selected candidates were checked to eliminate the 
candidates with a smear distribution pattern of the small 
RNAs. For a certain miRNA family, candidates with any 
mismatch in mature miRNAs or more than 3 mismatches 
in pre-miRNAs were treated as different members. Redun- 
dant pre-miRNA sequences were eliminated. 

Identification of loci generating phased small RNAs 

The locations of small RNA clusters were pinpointed 
with the definition of a small RNAs cluster as a 



segment of sequence mapped by at least 9 distinct 
small RNAs and without other small RNAs in its 
flanking 100-bp regions [42]. A pipeline containing 
several perl scripts were then employed to identify loci 
generating phased siRNAs from the identified small 
RNA clusters [24]. 

Target identification by degradome sequencing 

After adaptor trimming and filtering out the reads 
mapped to house-keeping noncoding RNAs, such as 
tRNAs, rRNAs, snRNAs and snoRNAs, 20-21-nt high 
quality reads were kept for further analysis. Targets of 
miRNAs and phased small RNAs were identified using 
the CleaveLand pipeline [45]. Briefly, 20-21-nt small 
RNA reads were first mapped to tobacco ESTs and GSSs 
separately with the SHRiMP program. The reference 
sequence fragments of 30 nt were then spUced to make 
sure the position with degradome reads mapped was 
included. Fragments were further analyzed to see if they 
are targeted by miRNAs. The fragments with an align- 
ment cutoff value less than 7 were considered as candi- 
date targets. The target genes identified by degradome 
sequencing can be classified into five categories based on 
the relative abundance of degradome transcripts mapped 
to the expected cleavage sites and other sites. 

Northern blot analysis of miRNA expression 

Total RNA was isolated from tobacco leaves and roots 
collected from plants treated with wounding or topping 
or untreated (control) using the TRIzol® Reagent kit 
(Invitrogen). Approximate 20 (ig of total RNA was frac- 
tionated on a 15% denaturing gel for 2 hours and then 
transferred electrophoretically to the nylon membrane 
(Nytran SuperCharge membranes). Membranes were 
then UV cross-linked in a Straralinker 1800 Stratagene 
and baked for at least 2 hours at 80°C. DNA oligos 
complementary to miRNA sequences and labeled with 
digoxin were synthesized by Invitrogen. Membranes 
were prehybridized for 2 hours at 50°C using Per- 
fectHybTM Plus buffer (SIGMA) and then hybridized 
with digoxin-labeled probes for 16-20 hours at 42°C. 
Membranes were then washed three times (two times 
with 2 X SSC + 0.1%SDS and one time with 0.5 x SSC 
+ 0.1%SDS) and sealed with the Blocking Solution 
(SIGMA) containing Anti-Digoxigenin-AP (Roche) for 1 
hour. Membranes were finally incubated with Detection 
Buffer (0.1 M Tris-Cl, 0.1 M NaCl, pH9.5) and CSPD 
(Roche) for 5 min and exposed to X-ray films for 4 
hours or overnight. 

Additional material 



Additional file 1: Table 51. Tobacco miRNA families identified in this 
study. 
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Additional file 2: Table S2. Loci with significant (phase score > 10) 21- 
nt or 24-nt phase signals. 

Additional file 3: Table S3. miRNA targets identified by degradome 
sequencing. 

Additional file 4: Table S4. Identification of targets of siRNAs from the 
phased-siRNA generating loci by degradome sequencing. 
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